智能论文笔记

Fast Topological Clustering with Wasserstein Distance

Tananun Songdechakraiwut , Bryan M. Krause , Matthew I. Banks , Kirill V. Nourski , Barry D. Van Veen

分类：机器学习

2021-11-30

许多真实网络展出的拓扑模式激励了基于拓扑的方法的发展，以评估网络的相似性。然而，提取拓扑结构是困难的，特别是对于节点度范围超过多个数量级的大型和密集网络。在本文中，我们提出了一种新颖的和计算实用的拓扑集群聚类方法，将复杂网络与复杂的拓扑结构从持续的同源性和最优运输中使用主治理论。这种网络通过基于其拓扑和几何结构的基于质心的聚类策略聚合到集群中，在不同网络中保留了节点之间的对应关系。拓扑接近和质心的概念使用新颖的和有效的方法来计算Wassersein距离和持久性条形码的持久条形码计算，以及与连接的分量和循环相关联的持久性条形码。所提出的方法被证明使用模拟网络和测量的功能性脑网络有效。

translated by 谷歌翻译

Deep Learning Based Detection of Enlarged Perivascular Spaces on Brain MRI

Tanweer Rashid , Hangfan Liu , Jeffrey B. Ware , Karl Li , Jose Rafael Romero , Elyas Fadaee , Ilya M. Nasrallah , Saima Hilal , R. Nick Bryan , Timothy M. Hughes

分类：计算机视觉 | 机器学习

2022-09-27

深度学习已在许多神经影像应用中有效。但是，在许多情况下，捕获与小血管疾病有关的信息的成像序列的数量不足以支持数据驱动的技术。此外，基于队列的研究可能并不总是具有用于准确病变检测的最佳或必需成像序列。因此，有必要确定哪些成像序列对于准确检测至关重要。在这项研究中，我们旨在找到磁共振成像（MRI）序列的最佳组合，以深入基于学习的肿瘤周围空间（EPV）。为此，我们实施了一个有效的轻巧U-NET，适用于EPVS检测，并全面研究了来自易感加权成像（SWI），流体侵入的反转恢复（FLAIR），T1加权（T1W）和T2的不同信息组合 - 加权（T2W）MRI序列。我们得出的结论是，T2W MRI对于准确的EPV检测最为重要，并且在深神经网络中掺入SWI，FLAIR和T1W MRI可能会使精度的提高无关。

translated by 谷歌翻译

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Kaustubh D. Dhole , Varun Gangal , Sebastian Gehrmann , Aadesh Gupta , Zhenhao Li , Saad Mahamood , Abinaya Mahendiran , Simon Mille , Ashish Srivastava , Samson Tan

分类：自然语言处理 | 人工智能 | 机器学习

2021-12-06

数据增强是自然语言处理（NLP）模型的鲁棒性评估的重要组成部分，以及增强他们培训的数据的多样性。在本文中，我们呈现NL-Cogmenter，这是一种新的参与式Python的自然语言增强框架，它支持创建两个转换（对数据的修改）和过滤器（根据特定功能的数据拆分）。我们描述了框架和初始的117个变换和23个过滤器，用于各种自然语言任务。我们通过使用其几个转换来分析流行自然语言模型的鲁棒性来证明NL-Upmenter的功效。基础架构，Datacards和稳健性分析结果在NL-Augmenter存储库上公开可用（\ url {https://github.com/gem-benchmark/nl-augmenter}）。

translated by 谷歌翻译

Neuronal Learning Analysis using Cycle-Consistent Adversarial Networks

Bryan M. Li , Theoklitos Amvrosiadis , Nathalie Rochefort , Arno Onken

分类：机器学习

2021-11-25

了解任务学习后神经电路中的活动如何重新成像，可以揭示学习的基本机制。由于神经成像技术的最近进步，高质量的记录可以在多天甚至几周内从数百个神经元获得。然而，人口响应的复杂性和维度对分析构成了重大挑战。研究神经元适应和学习的现有方法通常对数据或模型产生强烈的假设，导致不概括的偏置描述。在这项工作中，我们使用一个叫做 - Cycleangan的深度生成模型的变种，了解预先和后学后神经活动之间的未知映射，记录了$ \ texit {vivo} $。我们开发一个端到端的管道到预处理，火车和评估荧光信号，以及解释所得到的深度学习模型的过程。为了评估我们方法的有效性，我们首先在具有已知地面实话转换的合成数据集中测试我们的框架。随后，我们将我们的方法应用于从初级视觉皮层记录的表现小鼠记录的神经活动，其中小鼠从新手转换到基于视觉的虚拟现实实验中的专家级性能。我们评估了产生的钙信号的模型性能及其推断的尖峰列车。为了最大限度地提高性能，我们推导了一种新的预选神经元方法，使得基于卷积的网络可以利用神经活动中存在的空间信息。此外，我们还纳入了视觉解释方法，以提高我们工作的可解释性，并进入学习过程中的洞察力，表现在细胞活动中。我们的结果表明，分析具有数据驱动的深度无监督方法的神经元学习过程，其可能以不偏不倚的方式解开变化的可能性。

translated by 谷歌翻译

CytoImageNet: A large-scale pretraining dataset for bioimage transfer learning

Stanley Bryan Z. Hua , Alex X. Lu , Alan M. Moses

分类：计算机视觉 | 人工智能

2021-11-23

动机：近年来，基于形象的生物测定稳步成为高吞吐量，引发了快速自动化方法，以提取来自数百种图像的生物学有意义的信息。从想象成的成功取得灵感，我们驯服细胞造就花，一个公开源和弱标记的显微镜图像的大规模数据集（890K图像，894级）。预先训练的细胞造黄养箱产生了对上游显微镜分类任务的想象成特征具有竞争力的功能。我们展示了CytoImAgenet的证据表明，CytoImAgenet在想象中训练有素的功能中捕获信息不可用。数据集是在https://www.kaggle.com/stanleyhua/cyaagenet中提供的。

translated by 谷歌翻译

Quantifying Topology In Pancreatic Tubular Networks From Live Imaging 3D Microscopy

Kasra Arnavaz , Oswin Krause , Kilian Zepf , Jelena M. Krivokapic , Silja Heilmann , Jakob Andreas Bærentzen , Pia Nyeng , Aasa Feragen

分类：计算机视觉 | 机器学习

2021-05-20

由胰腺管网络的具有挑战性的分割任务激发，本文解决了两个通常遇到生物医学成像问题的问题：分割的拓扑一致性，以及昂贵或困难的注释。我们的贡献如下：a）我们提出了一个拓扑评分，该评分衡量了预测和地面真理分割之间的拓扑和几何一致性，应用于模型选择和验证。 b）我们在时间序列图像数据上为这一困难的嘈杂任务提供了完整的深度学习方法。在我们的方法中，我们首先使用半监管的U-NET体系结构，适用于通用分割任务，该任务共同训练自动编码器和分割网络。然后，随着时间的流逝，我们使用循环的跟踪来进一步改善预测的拓扑。这种半监督的方法使我们能够利用未经通知的数据来学习特征表示，尽管我们的带注释的培训数据的变化非常有限，但该特征表示具有较高可变性的数据。我们的贡献在具有挑战性的分割任务上得到了验证，从嘈杂的实时成像共聚焦显微镜中定位胎儿胰腺中的管状结构。我们表明，我们的半监督模型不仅优于完全监督和预训练的模型，而且还优于在训练过程中考虑拓扑一致性的方法。此外，与经过平均循环得分为0.762的CLDICE的U-NET相比，我们的方法的平均环路得分为0.808。

translated by 谷歌翻译

Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models

Bryan A. Plummer , Liwei Wang , Chris M. Cervantes , Juan C. Caicedo , Julia Hockenmaier , Svetlana Lazebnik

分类：

2015-05-19

The Flickr30k dataset has become a standard benchmark for sentence-based image description. This paper presents Flickr30k Entities, which augments the 158k captions from Flickr30k with 244k coreference chains, linking mentions of the same entities across different captions for the same image, and associating them with 276k manually annotated bounding boxes. Such annotations are essential for continued progress in automatic image description and grounded language understanding. They enable us to define a new benchmark for localization of textual entity mentions in an image. We present a strong baseline for this task that combines an image-text embedding, detectors for common objects, a color classifier, and a bias towards selecting larger objects. While our baseline rivals in accuracy more complex state-of-the-art models, we show that its gains cannot be easily parlayed into improvements on such tasks as image-sentence retrieval, thus underlining the limitations of current methods and the need for further research.

translated by 谷歌翻译

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

Niranjan Srinivas , Andreas Krause , Sham M. Kakade , Matthias Seeger

分类：

2009-12-21

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multiarmed bandit problem, where the payoff function is either sampled from a Gaussian process (GP) or has low RKHS norm. We resolve the important open problem of deriving regret bounds for this setting, which imply novel convergence rates for GP optimization. We analyze GP-UCB, an intuitive upper-confidence based algorithm, and bound its cumulative regret in terms of maximal information gain, establishing a novel connection between GP optimization and experimental design. Moreover, by bounding the latter in terms of operator spectra, we obtain explicit sublinear regret bounds for many commonly used covariance functions. In some important cases, our bounds have surprisingly weak dependence on the dimensionality. In our experiments on real sensor data, GP-UCB compares favorably with other heuristical GP optimization approaches.

translated by 谷歌翻译

Conservation Tools: The Next Generation of Engineering--Biology Collaborations

Andrew Schulz , Cassie Shriver , Suzanne Stathatos , Benjamin Seleb , Emily Weigel , Young-Hui Chang , M. Saad Bhamla , David Hu , Joseph R. Mendelson III , .

分类：机器学习

2023-01-03

The recent increase in public and academic interest in preserving biodiversity has led to the growth of the field of conservation technology. This field involves designing and constructing tools that utilize technology to aid in the conservation of wildlife. In this article, we will use case studies to demonstrate the importance of designing conservation tools with human-wildlife interaction in mind and provide a framework for creating successful tools. These case studies include a range of complexities, from simple cat collars to machine learning and game theory methodologies. Our goal is to introduce and inform current and future researchers in the field of conservation technology and provide references for educating the next generation of conservation technologists. Conservation technology not only has the potential to benefit biodiversity but also has broader impacts on fields such as sustainability and environmental protection. By using innovative technologies to address conservation challenges, we can find more effective and efficient solutions to protect and preserve our planet's resources.

translated by 谷歌翻译

Meta-learning generalizable dynamics from trajectories

Qiaofeng Li , Tianyi Wang , Vwani Roychowdhury , M. Khalid Jawed

分类：机器学习

2023-01-03

We present the interpretable meta neural ordinary differential equation (iMODE) method to rapidly learn generalizable (i.e., not parameter-specific) dynamics from trajectories of multiple dynamical systems that vary in their physical parameters. The iMODE method learns meta-knowledge, the functional variations of the force field of dynamical system instances without knowing the physical parameters, by adopting a bi-level optimization framework: an outer level capturing the common force field form among studied dynamical system instances and an inner level adapting to individual system instances. A priori physical knowledge can be conveniently embedded in the neural network architecture as inductive bias, such as conservative force field and Euclidean symmetry. With the learned meta-knowledge, iMODE can model an unseen system within seconds, and inversely reveal knowledge on the physical parameters of a system, or as a Neural Gauge to "measure" the physical parameters of an unseen system with observed trajectories. We test the validity of the iMODE method on bistable, double pendulum, Van der Pol, Slinky, and reaction-diffusion systems.

translated by 谷歌翻译